A general framework for reinforcement learning
نویسنده
چکیده
In this artide we p1'Opose a geneml framework for sequential dedsion making. The fi'amework is baSf.d on the observation that the. derication of the optimal behaviour ttnde.T various decision criteria follows the. same patte!"n: the cost of policies can be decomposed into the successive applicatlOn of an opemtor that defines the related dynamic programming algordhm and this ope.mtor descnbes complete.ly the structure of the decision problem. vVe take this mapping (the 80 called one step lookahead (OLA) cost mapping) as our starting point. This enables Ihe unified Irealmenl of various decision ,Tile ria (e.g. Ihe expected value (Tilerion OJ' Ihe worsl-case 'Tile1'ion). The main resuli 0./ Ihis arlicle says Ihal Wldt1" minimal condilions oplimal slalionm'Y policies are greedy "W.l·.I. Ihe oplimal cosl Junelion and vice versa. Based on Ihis n:suli "We ./eel Ihal former' resttlis on r'emforcemenl learning can be Imnsfel"1"td 10 olhel' decision (Tileria pl'ovided Ihal lhe decision criltTion is decomposable by an apPl"Opj'iale mapping.
منابع مشابه
Reinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملReinforcement Learning in Neural Networks: A Survey
In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...
متن کاملHierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents
This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...
متن کاملA .net Reinforcement Learning Platform for Multiagent Systems
Reinforcement learning is a convenient way of allowing the agents to autonomously explore and learn the best action sequences that maximize their overall value, based on successive rewards received from the environment. Among other similar libraries and platforms, the reinforcement platform presented here is especially designed to be used with the .NET framework and provides a general support f...
متن کاملMultiagent Reinforcement Learning in Stochastic Games
We adopt stochastic games as a general framework for dynamic noncooperative systems. This framework provides a way of describing the dynamic interactions of agents in terms of individuals' Markov decision processes. By studying this framework, we go beyond the common practice in the study of learning in games, which primarily focus on repeated games or extensive-form games. For stochastic games...
متن کاملDeep Reinforcement Learning as Foundation for Artificial General Intelligence
Deep machine learning and reinforcement learning are two complementing fields within the study of intelligent systems. When combined, it is argued that they offer a promising path for achieving artificial general intelligence (AGI). This chapter outlines the concepts facilitating such merger of technologies and motivates a framework for building scalable intelligent machines. The prospect of ut...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995